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Abstract 



Complementation and determinization are two fundamental notions in automata theory. 
The close relationship between the two has been well observed in the literature. In the case of 
nondeterministic finite automata on finite words (NFA) , complementation and determinization 
have the same state complexity, namely G(2") where n is the state size. The same similar- 
ity between determinization and complementation was found for Biichi automata, where both 
operations were shown to have 2^("'s"-) state complexity. An intriguing question is whether 
there exists a type of w-automata whose determinization is considerably harder than its comple- 
mentation. In this paper, we show that for all common type w-automata, the determinization 
problem has the same state complexity as the corresponding complementation problem at the 
granularity of 2^^'^. 

1 Introduction 

Automata on infinite words (w-automata) have wide apphcations in synthesis and verification of 
reactive concurrent systems. Complementation and determinization are two fundamental notions 
in automata theory. The complementation problem is to construct, given an w-automaton A, an 
w-automaton B that recognizes the complementary language of A. Provided that a given A is 
nondeterministic, the determinization problem is to construct a deterministic w-automaton B that 
recognizes the same language as A does. 

The close relationship between determinization and complementation has been well observed 
in the literature (see |Cho741 IVarOTj for discussions). A deterministic cj-automaton can be trivially 
complemented by dualizing its acceptance condition. As a result, a lower bound on complementa- 
tion applies to determinization while an upper bound on determinization applies to complementa- 
tion. Besides easily rendering complementation, determinization is crucial in decision problems for 
tree temporal logics, logic games and system synthesis. For example, using game-theoretical se- 
mantics |GH82j . complementation of w-tree automata (w-tree complementation) not only requires 
complementation of w-automata, but also requires the complementary w-automata be deterministic 
(called co-determinization). Therefore, lowering the cost of w-determinization also improves the 
performance of w-tree complementation. 

In the case of nondeterministic finite automata on finite words (NFA), complementation and 
determinization have the same state complexity, namely 0(2") where n is the state size. The 
same similarity between determinization and complementation was found for Biichi automata, 
where both operations were shown to have 2®("'s") state complexity. An intriguing question is 
whether there exists a type of w-automata whose determinization is considerably harder than its 
complementation. In this paper, we show that for all common type w-automata, the determinization 



problem has the same state complexity as the corresponding complementation problem at the 
granularity of 2®^') (see Figure [1]). 

Related Work. Biichi started the theory of w-automata. The original cj-automata used to 
establish the decidability of SIS |Buc66j are now referred to as Biichi automata. Shortly after 
Biichi's work, McNaughton proved a fundamental theorem in the theory of w-automata, that is, uj- 
regular languages (the languages that are recognized nondeterministic Biichi automata) are exactly 
those recognizable by the deterministic version of a type of a;-automata, now referred to as Rabin 
automata [McN66j . 

The complexity of McNaughton's Biichi determinization is double exponential. In 1988, Safra 
proposed a type of tree structures (now referred to as Safra trees) to obtain a Biichi determinization 
that converts a nondeterministic Biichi automaton of state size n to an equivalent deterministic Ra- 
bin automaton of state size 2'-^("^§") and index size 0(n) [Saf88j. Safra's construction is essentially 
optimal as the lower bound on state size for Biichi complementation is 2'^("^s") |Mic88[ [Lod99] . 
Later, Safra generalized the Biichi determinization to a Streett determinization which, given a 
nondeterministic Streett automaton of state size n and index size k, produces an equivalent de- 
terministic Rabin automaton of state size 

20(nk\gnk) g^^^ jj^^g^ g-^g Q^^^^ [Saffi2]. A variant 
Biichi determinization using a similar tree structure was proposed by Muller and Schupp |MS95] . 
Recently, Piterman improved both of Safra's determinization procedures with a node renaming 
scheme [Pit06j . Piterman's constructions are more efficient than Safra's, though the asymptotical 
bounds in terms of 2^^'^ are the same. A big advantage of Piterman's constructions, however, is 
to output deterministic parity automata, which is easier to manipulate than deterministic Rabin 
automata. For example, there exists no efficient procedure to complement deterministic Rabin 
automata to Biichi automata |SV89j . while such complementation is straightforward and efficient 
for deterministic parity automata. 

In |CZL09l ICZllal ICZllbj we established tight bounds for the complementation of w-automata 
with rich acceptance conditions, namely Rabin, Streett and parity automata (see Figure [1]). The 
state complexities of the corresponding determinization problems, however, are yet to be settled. 
In particular, a large gap exists between the lower and upper bounds for Streett determinization. A 
Streett automaton can be viewed as a Biichi automaton (Q, S, Qq^ A, F) except that the acceptance 
condition T = {G, B)j, where / = [l..k] for some kandG,B : I ^ 2^, comprises k pairs of enabling 
sets G{i) and fulfilling sets B{i). A run is accepting if for every i € /, if the run visits G{i) infinitely 
often, then so does it to B{i). Therefore, Streett automata naturally express strong fairness con- 
ditions that characterize meaningful computations |FK841 lFra86| . Moreover, Streett automata are 
exponentially more succinct than Biichi automata in encoding infinite behaviors of systems [SV89] . 
As a results, Streett automata have an advantage in modeling the behaviors of concurrent and 
reactive systems. For Streett determinization, the gap between current lower and upper bounds is 
huge: the lower bound is 2^("''g") [CZllaj and the upper bound is 2'^("'''s"^) |Saf92l iPitOG] when 
k is large (say k = uj{n)). 

In this paper, we show a Streett determinization whose state complexity matches the one for 
Streett complementation. More precisely, our construction has state complexity 2*-'("'§"+"'^^s'=) for 
k = 0{n) and 2*-^^" '^"^ for k = uj(n). We note that this improvement is not only meant for large 
k; when k = O(logn), the difference between 2'^("-^s"+"''^^s'^) and 2'^("'^'^s"''^) is already substantial. 
The phenomenon that determinization and complementation have the same state complexity does 
not stop at Streett automata; we also show that this phenomenon holds for generalized Biichi 
automata, parity automata and Rabin automata. This raises a very interesting question: do deter- 
minization and complementation always "walk hand in hand"? Although the exact complexities 



2 



Type 


Bound 


Lower 


Upper 


Biichi 


26 (n Ig n) 


[Mic88] 


|Saf88| 


Generalized Biichi 


2e(nignfc) A; = 0(2") 


|Yan06j 


|KV05j 


Streett 


2@(nlgn+nklgk) ^ _ Q(j^'^ 

2e{nHgn) k = uo{n) 


|CZlla| 


|CZllb| 


Rabin 


2e(nfclgn) /t = 0(2") 


[CZL09] 


|KV05j 


Parity 


2B(nlgn) k = 0{n) 


[Mic88] 


|CZllb| 



Figure 1: Complementation and determinization complexities for w-automata of common types. 
The listed citations are meant for complementation only. 

of complementation and determinization for some or all types of w-automata could be differen10, 
the "coincidence" at the granularity of 2®^') is already intriguing. 

Our Approaches. Our improved construction bases on two ideas. The first one is what we 
have exploited in obtaining tight upper bounds for Streett complementation jCZllbj . namely, the 
larger the size of Streett index size, the higher correlations in the runs of Streett automata. We 
used two tree structures: ITS {Increasing Tree of Sets) and TOP {Tree of Ordered Partitions) to 
characterize those correlations. We observed that there is a similarity between TOP and Safra trees 
for Biichi determinization [Saf88]. As Safra trees for Streett determinization are generalization of 
those for Biichi determinization, we conjectured that ITS should have a role in improving Streett 
determinization. Our study confirmed this expectation; using ITS we can significantly reduce the 
size of Safra trees for Streett determinization. 

The second idea is a new naming scheme. Bounding the size of Safra trees alone cannot bring 
down the state complexity when the Streett index is small (i.e., k = 0{n)), because the naming 
cost becomes a dominating factor in this case. Naming is an integral part of Safra trees. Every 
node in a Safra tree is associated with a name, which is used to track changes of the node between 
the tree (state) and its successors. The current name allocation is a retail-style strategy; when a 
new node is created, a name from the pool of unused names is selected arbitrarily and assigned, 
and when a node is removed, its name is recycled to the pool. In contrast, our naming scheme 
is more like wholesale; the name space is divided into even blocks and every block is allocated at 
the same time. When a branch is created, an unused block is assigned to it, and when a branch is 
changed, the corresponding block is recycled. 

Paper Organization. Section [2] presents notations and basic terminology in automata theory. 
Section [3] introduces Safra's construction for Biichi and Streett determinization. Section [4] presents 
our construction for Streett determinization. Section [5] establishes tight upper bounds for the 
determinization of Streett, generalized Biichi, parity, and Rabin automata. Section [6] concludes 
with some discussion on future work. All proofs are omitted from the main text, but they can be 
found in the appendix. 

'^Recent work by Colcombet and Zdanowski, and by Schewe showed that the state complexity of determiniza- 
tion of Biichi automata on alphabets of unbounded size is between n((1.64n)") |CZ09| and 0((1.65n)") fSch09bl, 
which is strictly higher than the state complexity of Biichi complementation which is between f2(L(n)) Yan06 and 
0{n^ L{n)) [Sch09a] (where L{n) « (0.76n)")). However, Schewe's determinization construction produces Rabin 
automata with exponentially large index size, and it is not known weather Colcombet and Zdanowski's lower bound 
result can be generalized to Biichi automata on conventional alphabets of fixed size. 
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2 Preliminaries 



Basic Notations. Let N denote the set of natural numbers. We write for {fc G N | i < k < 
j}, for — 1], and [n] for [0..n). For an infinite sequence g, we use g{i) to denote tlie i-th. 
component for i eN. For a finite sequence a, we use |a| to denote the length of a, a[i] {i G [1..|q;|]) 
to denote the object at the i-th. position, and a[i..j] (resp. a[l..j)) to denote the subsequence of a 
from position i to position j (resp. j — 1). We reserve n and k as parameters of a dctcrminization 
instance (n for state size and k for index size), and define n = min(ra, k) and I = [l..k]. 

Automata and Runs. A finite automaton on infinite words (w-automaton) is a tuple A = 
(S, Q, Qq, a, J-") where S is an alphabet, Q is a finite set of states, Qq Q Q is a set of initial states, 
ACQxSxQisa set of transition relations, and J- is an acceptance condition. 

A finite run of A from state q to state q' over a finite word w is a sequence of states g = 
g{0)g{l) ■ ■ ■ g{\w\) such that g{0) = q, g{\w\) = q' and {g{i),w{i), g{i + 1)) G A for all i G [\w\]. An 
infinite word (a;- words) over S is an infinite sequence of letters in S. A run ^> of ^ over an w-word 
w is an infinite sequence of states in Q such that ^(0) G Qo and, {g{i),w{i), g{i+l)) G A for i G N. 
Let Inf{g) be the set of states that occur infinitely many times in g. An automaton accepts w if 
g satisfies T, which usually is defined as a predicate on Inf{g). By ^{A) we denote the set of 
w-words accepted by A. 

Acceptance Conditions and Types, w-automata are classified according their acceptance 
conditions. Below we list automata of common types. Let G,B be two functions 7 — )■ 2*^. 

• Generalized Biichi: {B)i: Vi G /, Inf{g) n B{i) ^ 0. 

• Biichi: {B)i with / = {1} (i.e., k = 1). 

• Streett, {G, B)i: G /, Inf{g) n G{z) Inf{g) n B{t) ^ 0. 

• Parity, {G,B)i with B{1) c G(l) C • • • C B{k) C G{k). 

• Rabin, [G, B]i: 3i G /, Inf{g) n G{i) / A Inf{g) n B{i) = 0. 

For simplicity, we denote a Biichi condition by F (i.e., F = -B(l)) and call it the final set of A- 
Note that Streett and Rabin conditions are dual to each other, and Biichi, generalized Biichi and 
parity automata are all subclasses of Streett automata. For a Streett condition {G,B)j, if there 
exist i,i' G /, B{i) = B{i'), then we can simplify the condition by replacing both {G{i),B{i)) 
and {G{i'),B{i')) by {G{i) Li G{i'),B{i)). For this reason, for every Streett condition {G,B)i, we 
assume that B is injective and = |I| < 2". 

Trees. A tree is a set C N* such that ii v ■ i G V then v G V and v ■ j G V for j < i. In this 
paper we only consider finite trees. Elements in V are called nodes and e is called the root. Nodes 

V ■ i are children of v and they are siblings to one another. A node v ■ i is said to be older then 

V ■ j if i < j. The set of v^s children is denoted by ch{v). A node is a leaf if it has no children. 
Given an alphabet S, a T,-labeled tree is a pair {V,L), where F is a tree and L : V ^ T, assigns 
a letter to each node in V. We refer to G V as V-value (structural value) and L{v) as L-value 
{label value) of v. An ordered and labeled tree is a tuple {V, L, -<) where {V, L) is a labeled tree 
and ^ is a partial order on V. 
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3 Safra's Determinization 



In this section we introduce Safra's constructions for Biichi and Streett determinization |Saf88t 
[SidF92] . 

By the nature of determinization, it is not surprising that all constructions rely on certain types 
of subset construction. For Biichi automata, this subset construction takes a particular form. Let 
A = {Q,Qo,T,, A, F) be a nondeterministic Biichi automaton and B a purported deterministic oj- 
automaton such that ^{A) = ^(B). It is not hard to build B such that if a run g = qQ,qi, . . . G 
of A over an infinite word w is accepting, then so is the run Q = qo,Qi, ■ ■ ■ G (2*^)'^ of B over w, 
obtained by the standard subset construction [RS59j . and qi G qi for every i > 0. This gives us 
^{A) C ^[B). The harder part is to also guarantee ^{B) C ^(A), that is, an accepting run g of 

B over w induces an accepting run g of A over w. Let S ^ S' denote that for every state q' & S', 
there exists a state q € S and a finite run p oi A such that p goes from q to q' and visits F. The 
key idea in many determinization constructions |McN66[ ISaf88l ISaf92[ IMS95[ IPitOG] relies on the 
following lemma, which itself is a consequence of Konig's lemma. 

Lemma 1 ( |McN66t ISaf88| ). Let g = qo,qi, . . . G (2^)'^ and {Si)uj = So, Si, . . . an infinite subse- 

F 

quence of g such that for every i > 0, Si ^ Si^i, then there is an accepting run g = qo,qi, ■ ■ ■ G 
of A such that q^ G iji for every i > 0. 

Safra proposed an essentially optimal construction for Biichi determinization j Saf88] . The 
ingenuity of his construction is to efficiently organize subsets of states using a type of tree structures, 
now referred to as Safra trees. In a Safra tree, each node is equipped with a set label, a subset of Q. 
We simply say that a node contains the states in its set label. The standard subset construction 
is carried out on all nodes in parallel, and each node v in the tree gives birth to a new child which 
contains the states both in F and in v. We say that a node v turns green if every state in v appears 

in a child of v, and vice versa. We have S ^ S' where S and S' are the set labels of v at two 
consecutive moments of v being green. If v turns green infinitely often, then we have an desired 
{Si)ui as stated in Lemma [H 

A key ingredient in Safra's construction is to assign names and colors to nodes, in order to track 
changes on nodes to identify those that turn green infinitely many times. For the sake of presen- 
tation clarity, we separate the naming and coloring convention from the core of the construction, 
the transition function (or the tree transformation rule). In this paper, use three colors: green, red 
and yellow. 

Procedure 1 (Naming and Coloring Convention). The following convention of naming and col- 
oring is applied in the transition function (see Procedure\^. 

1. Newly created nodes are marked red. 

2. Nodes whose all descendants are removed are marked green. 

3. Nodes are marked yellow unless they have been marked red or green. 

4. Nodes with the empty set label are removed. 

5. When a node is created, a name from the pool of unused name is selected and assigned to the 
node; when a node is removed, its name is recycled back to the pool. 

Procedure 2 (Safra Determinization Scheme). Given A = {Q,Q(),Ti, A,J^) as a nondeterministic 
automaton, this procedure outputs a deterministic Rabin automaton T> = {Q,qQ,T,, A, [G,B]j) such 
that 
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Q is the set of Safra trees, 
qo Q is the tree with just the root. 
{^3 q' = A(g) is defined as follows. 

\^3i l Expansion; Spawn new children. 

\^31 2 Subset Construction. Update the set label of each node using the subset construction. 
[23.5 Horizontal Merge. Make sure that set labels of any two siblings are disjoint. 
VA°J[ 4 Vertical Merge. Remove all descendants of a node if the node turns green. 

[G, B]j is such that for every i & I 

G{i) = {q ^ Q \ Q contains a red node with name i or does not contains a node with name i } 
Bii) = {q Q \ q contains a green node with name i } 

From now on, we concentrate on defining transition functions; this amounts to customizing 
Step (|2l3p for Biichi and Streett automata, respectively. 

Biichi Determinization. 

Definition 1 (Safra Trees for Biichi Determinization (STB)). A Safra tree for Biichi determiniza- 
tion (STB) is an ordered and labeled tree {V, L, ~<str) with L = {Ln, Lg, Lc) where 

U^l L„ : y — 7> [l..n] assigns each node a unique name. 

Lg : V ^ 2^ assigns each node a subset of Q such that for every node v, [Jyi^ch{v) ^s{v') C 
Ls{v) and Ls{v') Ls[v") = for every two distinct v',v" G ch{v). 

[215 Lc : y — )■ {green, red, yellow} assigns each node a color. 

''^str- V xV linearly orders siblings such that v -<str v' iff v is to the left of v' . 

We note that there are several subtleties in this definition. First, nodes are identified with 
their names instead of their y- values (structural values); that is exactly the purpose of naming. 
When we say that a node u in a tree t and its successor t' (i.e., t' = A(t)), we actually mean that 
two nodes with the same name, one in t and the other in t' , and their corresponding V-values do 
not matter. But for simplicity, we keep using ^/-values to refer to nodes. We shall address an 
exception, however, in our construction for Streett determinization where due to renaming, the 
"same" node v can have different names in t and t' (Section U]). Second, we require that newly 
created nodes be added to the right of all existing siblings. That is, v -<str v' iff v is older than v' . 
Formally, this would require defining the structural ordering -<str between nodes in a tree t and its 
successor t' . But we avoid this complication as the meaning is clear from the context. Third, '^str 
is in fact of no importance in Safra's original construction; it is totally unnecessary to require that 
younger siblings be put to the right of older ones. What matters is that this older-than relation 
determines the priority of state ownership between sibling nodes at the stage of horizontal merge 
(Step (|2[313p ). But we shall explain in Section H] why -<str is needed to get the desired complexity 
result for our construction. We choose to introduce the relation here for a uniform presentation of 
all determinization constructions in this paper. 
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Procedure 3 (Biichi Determinization |Saf88j ). Given a nondeterministic Biichi automaton A = 
{Q, Qq, S, a, F), this procedure outputs a deterministic Rabin automaton T> = {Q, %, S, A, [G, B]j), 
where Q is the set of STB, I = [l..n], qo and [G,B]j are as defined in ProcedurelE (for the root v 
in % we have Ln{v) = 1, Ls{v) = Qo, and = red), and A is defined such that q' = A(g) if and 
only if q' is the STB obtained by applying the following transformation rule to q. 

Expansion. For each node v in q, add a new child v' with label Ls{v) D F. 

Subset Construction. For each node v in Q, update set label Ls{v) to A{Ls{v)). 

[3.5 Horizontal Merge. For any state q and any two siblings v and v' such that q G Ls{v)r]Ls{v'), 
if V -<str v' , then remove q from v' and all its descendants. 

Vertical Merge. For every node v, if Ls{v) = ^v'^ch{v)^sW), then remove all descendants of 

V. 

Streett Determinization. Safra generalized the construction to Streett determinization |Saf92] . 
Let A = {QjQojT,, A, {G,B)i) be a Streett automaton. A J Q I serves as a witness set for an 
accepting run g of A in the following sense: for every j in J, g visits B(j) infinitely often while for 
every j I \J, g visits G{j) only finitely many times. We call indices in / \ J negative obligations 
and indices in J positive obligations. It is easily seen that a run g is accepting if and only if g 
admits a witness set J Q L We say that a finite run p fulfills J if for every j in J, p visits B(j), but 

for every j ^ I \ J, p does not visit G{j). By 5 ^ S" we mean that every state in S' is reachable 
from a state in S via a finite run p that fulfills J. 

Lemma 2 ( |Saf92| ). Let g = qo,qi, ■ ■ ■ € (2'^)'^ and = So,Si,... an infinite subsequence of g 

such that for every i >0, Si ^ Si+i, then there is an accepting run g = qQ,qi, . . . € Q'^ of A such 
that qi £ (ji for every i >0. 

However, there are 2^ potential witness sets and a naive implementation would give a double 
exponential construction. Safra's idea is to organize both witness sets and state sets in a tree where 
each node is labeled by a witness set as well as by a state set. We illustrate the idea using an 
example. Let k = 3 and / = [1..3]. 

The initial state in the deterministic automaton S is a root v labeled with / and Qq. The 
obligation of v is to detect runs that fulfills /, i.e., visit B{1), B{2), and -6(3) infinitely often. Once 
a run visits -6(3) (fulfilling a positive obligation), the run moves to a new child, waiting to visit 
-6(2), and once the run visits -6(2), the run moves to anther new child, waiting to visit -6(1), and 
so on. Technically speaking, states are moved around, which induces moving of runs. 

This sequential sweeping can be stalled because some runs in v, from some point on, could never 
visit -6(3). So V should spawn a child ^3 with the witness set / \ {3} = {1,2}, to detect runs that 
fulfill {1,2} (see Figure [2al) . A run in ^3 should never visit G{3), its negative obligation. Or the 
run will be reset (the exact meaning of reset is shown in Step ()4[2[312p ). If a run visits -6(3), then 
the run should be moved into a child V2 with the witness set / \ {2} = {1, 3}. Similarly, we may 
have a child vi associated with the witness set / \ {1} = {2, 3}. Note that this spawning should be 
recursively applied until we arrive at leaves whose witness sets are singletons. For example, runs 
staying in V3 could never visit -6(2), and hence V3 should also spawn a child V32 with the witness 
set /\ {3, 2} = {1}. In general, two nodes, even if they are siblings, may have the same witness set 
as they are created at different times. 
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v. {1,2,3} 



V3 ■■ {1,2} 



V2 ■■ {1,3} 



VI : {2,3} 



vo : {1,2,3} 



^;32:{1} {2} {1,2,3} {1} {3} {1,2,3} {2} {3} {1,2,3} {1,2} {1,3} {2,3} {1,2,3} 

(a) 
V : 



U3 : 3 V2 '■ 2 vi : I vq : 



^;32:2103103203210 
(b) 



Figure 2: Two equivalent representations of witness sets. In (a) nodes are explicitly labeled with 



witness sets while in |(b)| witness sets are implicitly represented by index labels. 



Let v' be a node with the witness set J. If a run fulfills J, then the run moves into a special 
child that also has / as its witness set. If all children of v' are special, then we say v' turns green. 
It is not hard to verify that if S and S' are the state sets of v' at any two consecutive moments 

when v' turns green, then we have S ^ S' . But v' may never turn green because not all runs fulfill 
J. That is why these special children of v' also have / as their witness sets, for they also spawn 
children, behaving just as the root. We refer the reader to |Sch02] for a detailed exposition of this 
spawning and sweeping process. 

Figure [2] shows part of a Safra tree for Streett determinization in two equivalent representations 
(with respect to witness sets). The representation shown in Figure [2b] is used in Definition [2j Note 
that, in general, not all witness sets necessarily appear in a Safra tree. 

Definition 2 (Safra Trees for Streett Determinization [STS)). A Safra tree for Streett deter- 
minization (STS) is an ordered and labeled tree {V, L, ~<str) with L = {Ln, Lg, Lc, Lh) where 

Ln : V [l..nk] assigns each node a unique name. 

Ls :V ^ 2'^ assigns each node a subset ofQ such that for every node v, Ls{v) = y^v'&ch{v)LsW) 
and Ls{v') Ls{v") = for every two distinct v',v" € ch{v). 

Lc'.V ^ {green, red, yellow} assigns each node a color. 

\M4 Lh : V —!' I L) {0} assigns each node an index in I U {0}. For a node v, let L'f^{v) denote the 
sequence of I -elements from the root to v with excluded and Lf^^{v) the set of I -elements 
occurring in Lj^{v). We require that for every node v, there is no repeated index in the 
sequence Lj^{v). 

'^str'- V ^ V linearly orders siblings such that v -^str v' iff v is to the left of v' , and if 
Lh{v) > Lh{v'), then v ^str v' . 

We require that a newly created node be added to the right of all existing siblings of the same 
L/i-value. That is, for two sibling v and v\ v -<str v' iff Lh{v) > Lh{v'), or Lh{v) = Lh{v') and v is 
older than v' . We define a variant sibling ordering -<pri such that v -<pri v' iff Lh{v) < Lh{v'), or 
Lfi{v) = Lfi{v') and v is older than v'. As mentioned before, the ordering ^gtr is of no importance 
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in Safra's determinization constructions, while both -<str and -<pri are needed for the correctness 
of our construction (Section H]). 

Procedure 4 (Streett Determinization |Saf92] ) . Given a nondeterministic Streett automaton A = 
{Q, Qo, S, A, (G, B)j), this procedure outputs a deterministic Rabin automaton B = {Q, %, S, A, [G, B]j) 
where Q is the set of STS, I = [l..nk], % and [G,B]j are as defined in Procedure\E (for the root 
V in qo we have Ln{v) = 1, Ls{v) = Qq, Lc{v) = red and Lh{v) = 0), and A is defined such that 
q' = A(g) if and only if q' is the STS obtained by applying the following transformation rule to q. 

Subset Construction; for each node v in Q, update set label Ls[v) to A{Lg{v)). 
Expansion. Apply the following transformations downwards from the root: 
\4\S\ 1 If V is a leaf with Lf^^{v) = I, stop. 

\4\S\ 2 If V is a leaf with Lf^^(v) ^ I, add a child v' to v such that Ls{v') = Ls{v) and Lh{v') = 
max(/ \ Lf^^^{v)). 

\4\^ 3 If V is a node with j children vi,. . . ,Vj. Let ii, . . . ,ij be the corresponding index labels. 
Consider the following cases for each j' S 

\4\2\3\ 1 If Ls{vji) n B{iy) / 0. Add a child v' to v with Ls{v') = Ls{vji) n B{iy) and 

Lh{v') = max([0..ij')n((/U{0})\L^^*(t;))), and remove the states in Ls{vji)riB(ij') 

from Vji and all the descendants of Vji . 
\4[^ 2 If Ls{vf) n B{ij>) = and Ls{vf) D G{if) / 0. Add a child v' to v with Ls{v') = 

Ls{vji) n G{iji) and Lfi{v') = Lfi{v), and remove the states in Ls{vj') D G{iji) from 

Vji and all the descendants of Vji . 

^3 Horizontal Merge. For any state q and any two siblings v and v' such that q G Ls{v) nLs{v'), 
if V -<pri v' , then remove q from v' and all its descendants. 

Vertical Merge. For each v, if all children ofv have index label 0, then remove all descendants 
ofv. 

Step (|4[2[31ip says that if a run in Vji fulfills the positive obligation iji by visiting B{iji) 
{Ls{vji) n B{iji) 7^ 0), then the run moves into a new node v' {Ls{v') = Ls{vj') fl B{iji)), and v' 
continues to monitor if the run hits the next largest positive obligation in the witness set of its 
parent {Lh{v') = max([0..2j/) n ((/ U {0}) \ L^''*(t;)))). Step (I4l2l3l2p says that if this is not the case 
and the run in Vji also violates the negative obligation iji by visiting G{iji) {Ls{vji) fl G{iji) ^ 0), 
then this run is reset, in the sense that the states in Ls{vji) n G{iji) moved into a new child of v. 



4 Improved Streett Determinization 

In this section, we show our improved construction for Streett determinization. 



Improvement I. The first idea is what we have applied to Streett complementation [CZllb| . 
namely, the larger the /c, the more overlaps between i?(i)'s and between G(i)'s {i G /). Let us 
revisit the previous example, illustrated in Figured Assume that G{2) C G(3) (we say that G(3) 
covers G{2)). If a run stays at ^3, then the run is not supposed to visit G(3), or otherwise the run 
should have been reset by Step (I4121312p . Since the run cannot visit G{2) either, there is no point 
to check if it is to visit -6(2), and hence does not need to have a child with index label 2 (in this 
case the node v^2)- This simple idea already puts a cap on the size of STS. But it turns out that 
we can save the most if we exploit the redundancy on B instead of on G. 
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Reduction of Tree Size. Step (j4[2[31ip at a non-leaf node v is to check, for every child v' of 
V, if a run visits B{Lh(v')), and in the positive case, move the run into a new node. Let v' be 
a non-root node and v the parent of v' . Let ly = Lf^^'^v). As Step ()4[2p is executed recursively 
from top to bottom, it can be assured that at the moment of its arriving at v, for every i £ ly, we 
have Ls{v') fl B{i) = 0. By an abuse of notation, we write B{v) for Uj^j^B{j), and hence we have 
Ls{v') n B{v) = 0. Thus, there is no chance of missing a positive obligation even if we restrict Lh 
to be such that B{Lh{v')) ^ B{v). It follows that each node "watches" at least one more state that 
has not been watched by its ancestors, and therefore along any path of an STS, there are at most 
/i nodes with non-zero index labels (recall that /i = min(n, k) and note that the root is excluded 
as its index label is 0). Also, it can be shown by induction on tree height that an STS contains 
at most n nodes with index label 0, using the fact that set labels of sibling nodes are pairwise 
disjoint and the fact that if v is the parent of v' and Lh[v') = 0, then Ls{v') C Ls{v). Therefore, 
the number of nodes in an STS is bounded by n(/Li -|- 1). 

Reduction of Index Labels. Let I'^ = {i £ I \ B{i) C B{v)}. The above analysis tells us 
that Lh{v') € (/ \ /(,). However, we can further improve such that there is no j G {I \ I'y), 
{B{j) \ B{v)) C {B{Lh{v')) \ B{v)) and for any j G (I \ I^), {B{j) \ B{v)) = {B{Lh{v')) \ B{v)) 
implies Lfi{v') < j. We say that Lfi{v') minimally extends Lj^{v) if this condition holds. 

To formalize the intuition of minimal extension, we introduce two functions Cover :/*—)• 2^ 
and Mini : I* ^ 2^ as in |CZllb] . Cover maps finite sequences of /-elements to subsets of / such 
that 

Coveria) = { j G I | B{j) C |J i?(a[z]) }. 

1=1 

Note that Cover{e) = 0. Mini also maps finite sequences of /-elements to subsets of / such that 
j G Mini{a) if and only if j G / \ Cover (a) and 

V/ G / \ Coveria) (/ ^ j ^ B{f) U \J i?(a[i]) B{j) U |J B{a\i])) , (1) 

i=l i=l 

H l"l 
V/ G / \ Coveria) (j' < j ^ B(j') U |J i?(a[f]) ^ Bij) U J i?(a[i])) . (2) 

i=l 1=1 

Miniia) consists of index candidates to minimally enlarge Coveria); ties (with respect to set 
inclusion) are broken by numeric minimality (Condition ([2])). 

Improvement II. The second idea is a batch-mode naming scheme to reduce name combinations. 
As shown before, an STS can have n(/U -|-1) nodes, which translates to (n(/i -|-1))! name combinations 
according to the current "first-come-first-serve" naming scheme, that is, picking an unused name 
when a new node is created and recycling the name when a node is removed. When k = 0(n), 
the naming cost is higher than all other complexity factors combined, as (n(/i + 1))! = 2'^^^^^^'^^\ 
However, this can be overcome by dividing the name space into even buckets and "wholesaling" 
buckets to specific paths in an STS. 

Reduction of Names. Let t be an STS. A left spine iLS) is a maximal path I = vi - ■ ■ Vm such 
that Vm is a leaf, for any i G [2..m], vi is the left- most child of Vi-i, and vi is not a left-most child 
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of its parent. We call vi the head of /. Let head : V V he such that head{v) = v' if v' is the head 
of the LS where v belongs to. We say that I is the i-th LS of t if Vm is the i-th leaf of t, counting 
from left to right. It is clear that a tree with m leaves has m LS, and every node is on exactly 
one LS. Thus, an STS has at most n LS. Now the reason of using the structural ordering ^str is 
clear. If a non-head node on an LS has index label 0, then so should all of its siblings. But then 
all of them should have been removed. Therefore, only the head of an LS can have index label 0, 
which means that an LS can have at most fi-\- 1 nodes. 

We use + 1) names and divide them evenly into n buckets. Let bi {i G [l--''^]) denote the 
i-th bucket, bij = {fi + l)(i — 1) + j (i G [l..n], j € [l../i + 1]) the j-th name in the i-th bucket. 
We say that bn = + l)(i — 1) + 1 is the initial value of bi. Our naming strategy is as follows. 
Every LS / = ui • • • f in an STS is associated with a name bucket b and fi, • • • ,Vm are assigned 
names continuously from the initial value of b. For example, if / is associated with bucket bt, then 
Ln{vi) = (/X + l)(t — \) + i for i G [l..m]. The bucket association for each LS in an STS t can 
be viewed as selection function bucket : V [l..n] such that bucket{v) = i if node v is assigned a 
name in the i-th bucket. 

This naming strategy, however, comes with a complication; what if a leftmost sibling v is 
removed (due to Step (|4[2[3p ). and the second leftmost sibling v' (if exists) and all nodes belonging 
to the LS of which v' is the head, "graft into" the LS that v belongs to? The answer is that at the 
end of tree transformation, we need to rename those nodes that have moved into another LS. If a 
tree transformation turns t into t' , and during the process a node v joins another LS I in t' (which 
is also in t before the transformation), then we rename v to a name in the bucket that I uses in t, 
and recycle the bucket with which v was associated in t. 

Example 1 (New Naming Scheme). Figure\^ illustrates the changes of names in a sequence of tree 
transformations. We assume that there are 10 buckets bi — 6io, each of which is of size 4. Nodes 
in the graphs are denoted in the form v : Ln{v); all other types of labels are omitted for simplicity. 

Definition 3 (Reduced Safra Trees for Streett Determinization (fiSTS)). A reduced Safra tree for 
Streett determinization (uSTS) is an STS {V,L,^str) with L = {Ln, Lg, Lc, L^) that satisfies the 
following additional conditions: 

Oi Condition on L^. For each node v, if Mini{Lj^ (v)) ^ 0, then v is not a leaf node and for 
any child v' ofv, Lh{v') G Mini{Lj^ {v)) . 

Condition on L„. There exists a function bucket : V — )• [l..n] such that for every LS vi - ■ ■ Vm, 
we have bucket{vi) = bucket{vj) for i,j G [l..m] and Ln{vi) = (/i + l){bucket{vi) — 1) + i for 
i G [l..m]. 

Condition (|312p . together with the requirement that L„ is injective (Condition ()2[ip ). guarantees 
that no two distinct LS in a tree share a bucket. Condition (j31ip says that every fiSTS has fully 
grown left spines, that is, no leaf v can be further extended, as Mini{Lj^ (v)) = 0. To achieve this, 
we need the following procedure applied as the last step of each tree transformation. 

Procedure 5 (Grow Left Spine). Repeat the following procedure until no new nodes can be added: 
if V is a leaf and Mini{L'^{v)) ^ 0, add a new child v' to v with Ls{v') = Ls{v), Lfi{v') = 
max(Mini(L^(v))), Lc{v') = red, and Ln{v') = Ln{v) + 1. 

We note that the requirement that a fiSTS has fully grown left spines is not essential; we can 
"grow" a fiSTS "on-the-fly" as in |Saf921 [Pit06| . But this requirement simplifies the analysis on 
the number of combinations of index labels (see the proof of Theorem [2]). 
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Vo ■■ 1 



Vo ■■ 1 



vi : 2 



tig : 21 Vn : 13 wi : 2 



Vg : 21 wii : 13 



V2 ■■ 3 



W5 : 25 Vg : 22 wiq : 33 



U5 : 3 ug : 22 uig : 33 



Vs : 4 V4 : 17 Vq : 26 vj : 5 

(a) 



Wo : 1 



Vq : A vj : 5 



(b) 



vi : 2 



Vg : 21 



vn ■■ 13 



Vo ■ 1 



^'5 



Vi2 : 25 wg : 22 uio : 33 Vu '■ 29 



Vg : 2 



Vn ■■ 13 



Vq : 4 i>7 : 5 U13 : 26 



wg : 3 vio : 33 t;i4 : 29 
(d) 



Figure 3: (a) shows an STS with 7 LS: h : fofiV2i'3! ^2 : V/j^, Is : fsfe, h ■ v^, : vsvg, Iq : fio and 

&7) &2) ^6) ^9 and ^4, respectively. |(b)| shows the resulting 



Ij : Vu, associated with buckets 61, 65, 
STS* after removing V2 and its descendants. Nodes V5 and ?;6 migrate into h, and accordingly their 



name bucket 67 is recycled. Also recycled is bucket 65 due to the deletion of ^4. (c) shows the 
resulting STS after adding nodes V12, V13 and vu and forming two new LS: fi2fi3 and vu. Here 
?;i2 and V13 reuse the previously recycled bucket 67, while node ?;i4 takes an unused bucket bs- |(d)| 
shows the resulting STS after removing vi and its descendants. Nodes vs and migrate into h 
and take names 2 and 3, respectively. Buckets 62, 67 and 65 are recycled accordingly. 



Procedure 6 (Naming and Coloring on fj,STS). Naming and Coloring convention for fiSTS is 
the one for STS plus the following. 

Nodes in an LS, from the head downwards, are assigned continuously increasing names, 
starting from the initial value of a bucket. 

When an LS is created, nodes in the LS are assigned names from an unused name bucket; 
when an LS is removed (which only happens when its head is removed), the name bucket of 
the LS is recycled. 

When an LS I is grafted into another LS I', the name bucket of I is recycled and nodes on I 
are renamed according to Rule (|6I1|) . as if they were on I' originally. 

IB^4 Renamed nodes are marked red. 

Procedure 7 (Improved Streett Determinization). Given a nondeterministic Streett automa- 
ton A = {Q,Qq,T,, A, {G, B)i) , this procedure outputs a deterministic Rabin automaton B = 
{Q,qQ,Tj, A.,[G,B\j) , where Q is the set of fiSTS, L = [l..n(/u + 1)], [G,B]j is as defined in 
Procedure \^ % is a tree that is just a fully grown left spine obtained by growing the single root tree 
(which is defined in Procedure^ according to Procedure ([5]), and A is defined such that q' = A{q) 
if and only if q' is the fiSTS obtained by applying the following transformation rule to q. 

^1 Subset Construction. For each node v in Q, update set label Ls{v) to A(Ls{v)). 
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^2 Expansion. Apply the following transformations to non-leaf nodes recursively from the root. 
Let V be a node with j children vi, . . . ,Vj with ii,. . . ,ij as the corresponding index labels. 
Consider the following cases for each j' G 



7^1 If Ls{vj') n B{ij/) / 0. Add a child v' to v with Ls{v') = Ls{vj/) n B{ij>) and Lh{v') = 
max([0..ij/) H ({0} U Mini{Lj^ (v)))) , and remove the states in Ls{vj') D B{iji) from Vji 
and all the descendants of Vji . 



7^2 If Ls{vj>) n B{ij,) = and Ls{vj>) n G{ij>) / 0. Add a child v' to v with Ls{v') = 
Ls{vji) n G{iji) and Lh{v') = Lh{v), and remove the states in Ls{vji) n G{iji) from vji 
and all the descendants of Vji . 

^3 Horizontal Merge. For any state q and any two siblings v and v' such that q G Ls{v) r)Ls{v'), 
if V -<pri v' , then remove q from v' and all its descendants. 

^4 Vertical Merge. For each v, if all children ofv have index label 0, then remove all descendants 
of v. 

5 Grow the tree fully according to Procedure dSJ . 

Note that besides naming and renaming, the only major difference between Procedures [7] and H] 
is the way of selecting the next positive obligation. In Step (I7121ip . Mini{Lj^ (v)) is used in the 
calculation, while in Step (I4l2l3lip . L^*^* {v) is used. 

Theorem 1 (Streett Determinization: Correctness). Let A = {Q,Qo,T,, A, {G, B)i) be a Streett 
automaton with \Q\ = n and I = [1..A;], and B = {Q,qQ,T,, A,[G, B]j) the deterministic Rabin 
automaton obtained by Procedure^ We have .^{A) = ^{B). 



5 Complexity 

In this section we state the complexity results for the determinization of Streett, generalized Biichi, 
parity and Rabin automata. The corresponding proofs are given in the appendix. 

Theorem 2 (Streett Determinization: Complexity). Let A = {Q,Qo,Ti, A, {G, B)i) be a Streett 
automaton with \Q\ = n and I = [l..k], and B = {Q,qo,T,, A,[G, B]j) the deterministic Rabin 
automaton obtained by Procedure^ If k = 0{n), then \Q\ = 2'^{"is"-+"'^'g'^) and \I\ = 0{nk). If 
k = uj{n), then \Q\ = 20("'ig") and |/| = 0{n^). 

Figure H] breaks down the total cost into five categories and compares our construction with 
previous ones in each category. It turns out that the complexity analysis can be easily adapted for 
generalized Biichi and parity automata as they are subclasses of Streett automata. 

Corollary 1 (Generalized Biichi Determinization: Complexity). Let A be a generalized Biichi 
automaton with state size n and index size k. There is an equivalent deterministic Rabin automaton 
B with state size 2'^("-'s"'^). The index size is 0(nk) if k = 0{n) and 0(n'^) if k = uj{n). 

Corollary 2 (Parity Determinization: Complexity). Let A be a parity automaton with state size n 
and index size k. There is an equivalent deterministic Rabin automaton B with state size 2*^("^s"') 
and index size 0{nk). 

tin Piterman's construction, the number of ordered trees times the number of name combinations is bounded 
by (nfc)"'°. However, the second factor is still the dominating one, costing 2"("'='snfe)^ Also, there is no notion of 
color in Piterman's construction. Instead, each tree is associated with two special names both ranging from 1 to nk, 
resulting in the cost 0((nfc)^). 
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Cost 


Safra's 


Piterman's 


Ours 


Ordered Tree 


20(nk) 


20{nk) 


20(nlgn) ^ ^ Q(2") 


Name 


20(nk Ig nk) 


20(nk lgnfc)t 


20(nlgn) ^ ^ Q(2") 


Color 


20(nk) 


0((nA:)2)t 


20(nigfe) A; = 0(n) 


2a(nlgn) k = uj{n) 


Set Label 


20(nlgn) 


20(n Ign) 


20(nlgn) /c = 0(2") 


Index Label 


20(nk Ig nk) 


20{nk Ig nk) 


20(nlgra+nA:lgfc) ^ _ Q^'^^ 


20(n-ign,) A; = w(n) 


Total 


20(nk Ig nk) 


20{nk Ig nk) 


20(nlgn+n/elgfe) ^ _ Qj^j^") 


20(nMgn) A; = a;(n) 



Figure 4: Cost breakdown of Streett determinization 



The determinization of a Rabin automaton with acceptance condition [G, B]j (the dual of 
{G,B)j for / = [1..A;]) can be straightforwardly obtained by running, in parallel, k modified Safra 
trees, each of which monitors runs for an individual Rabin condition [G{i), B(i)] {i € I). Figured] 
summarizes the determinization complexities for w-automata of common types. 

Theorem 3 (Rabin Determinization: Complexity). Let A he a Rabin automaton with state size n 
and index size k. There is an equivalent deterministic Rabin automaton B with state size 2'^("''^''s"-) 
and index size 0{nk). 

We note that it is unlikely that there exists a Safra-tree style determinization for Rabin au- 
tomata, because an analogue of Lemma [2] fails due to the existential nature of Rabin acceptance 
conditions. 

6 Concluding Remarks 

In this paper we improved Safra's construction and obtained tight upper bounds on state complexity 
for the determinization of Streett, generalized Biichi, parity and Rabin automata. Combining these 
with the lower bound results in |CZllaj and previous findings in the literature, we now have a 
complete characterization of determinization complexity of w-automata of common types. 

Our results show an interesting phenomenon that in the asymptotic notation 2®*^ -', comple- 
mentation complexity is identical to determinization complexity. The same phenomenon happens 
to finite automata on finite words. We believe it is worth investigating the reason behind this 
phenomenon. 

As mentioned earlier, determinization procedures that output parity automata, like Piterman's 
constructions, are preferable to the classic ones that output Rabin automata. We plan to investigate 
how to combine Piterman's node renaming scheme with ours to obtain determinization procedures 
that output parity automata with optimal state complexity. 
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A Proof for Theorem [T] 



Theorem [T] (Streett Determinization: Correctness). Let A = {Q,Qo,T,, A, {G,B)i) be a Streett 
automaton with \Q\ = n and / = and B = {Q,qQ,T,, A,[G, B]j) the deterministic Rabin 

automaton obtained by Procedure [71 We have ^{A) = ^{B). 

Proof. {^{A) C ^(B)). This amounts to showing that if ^3 = q^qi ■ ■ ■ is an run of B over an 
infinite word w = wqWi . . . G ^{A), then there exist i ^ I and j > such that a node v with 
name i exists in ah Qj and v turns green infinitely often. This part of proof is almost the same as 
the one in [S af92 j. The only complication comes from renaming. As mentioned before, nodes are 
identified with their names, not their positions (F- values). So we have the situation that v with 
name i exists in qj, but it is renamed to i' in the following state qj+i- This happens when v is a 
second left-most sibling in qj and the leftmost sibling of v is removed in ^j+i, resulting in v joining 
another LS in Qj+i. However, such "grafting" is one directional, in the sense that the value of 
V after a grafting is lexicographically smaller than the y-value before the grafting. Due to well- 
foundedness of lexicographical order, if v stays in each state of a suffix of then v eventually has 
a fixed name i. This means that g eventually never visits G'(i), but visits B{i) infinitely often, and 
hence w G (S). 

{^{B) C ^{A)). We ought to show that ii w = wqWi . . . G ^{B), then the run g = qoqi ■ ■ ■ of 
B over w induces an accepting run of A over w. The assumption that g is accepting means that 
there exists an i G / such that g eventually never visits G{i), but visits B{i) infinitely often, or 
equivalently, i names a green or yellow node in every state in a suffix of g and there are infinitely 
many occurrences when the nodes named by i are green. Since renamed nodes are marked red, all 
nodes named by i in the suffix have to be the same one. It follows that a node v eventually stays 
in every state in a suffix of g and v turns green infinitely often. The rest of the proof is the same 
as the one in |Saf92j , with the help of Lemma [2] and the fact that using Mini to select the index 
labels of the children of v is sound and complete. □ 

B Formal Analysis of Complexity 

To formally analyze the complexity of our Streett determinization, we first introduce a notion called 
increasing tree of sets (ITS) [CZllbj . Here we switch to an informal notation of labeled trees and 
we identify a node with the sequence of labels from the root to the node. 

Definition 4 (Increasing Tree of Sets {ITS) [CZllb] ). An ITS T{n, k, B) is an unordered I -labeled 
tree such that a node a exists in T{n,k,B) iff\/i G [l..|a|], a[i] G Mini{a[l..i)). 

By the definition an ITS is uniquely determined by parameter n, k and B. Beside that, 
several properties are easily seen. First, the length of the longest path in T(n, k, B) is bound by 
/i. Second, if /? is a direct child of a, then (3 must contribute at least one new element that has 
not been seen from the root to a. Third, the new contributions made by (3 cannot be covered by 
contributions made by another sibling with ties broken by selecting the one with smallest index. 
As i? : / — )• 2*^ is one to one, we also view ITS as 2'3-labeled trees. 

Example 2 {ITS). Consider n = 3, k = 4, Q = {qo,qi,q2}, o-nd B : [1..4] l9 such that 
B{l) = {qo,qi}, B{2) = {qo}, B {3) = {qi , q2} , i?(4) = fe} • 

Figure\5\ shows the corresponding T{n,k, B). 
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2 : {qo} 4 : {^2} 

1: {90,91} 4 : {(72} 3: {91,92} 2 : {qo} 

I I I 

3: {91,92} 1: {90,91} 2 : {90} 1: {90,91} 



Figure 5: The ITS T{n,k,B) in Example O Note that {qo,Qi} and {qi,q2} cannot appear in the 
first level because {qo,Qi} covers {qo} and {^1,52} covers {92}- The leftmost node at the bottom 
level is labeled by {qi,q2} instead of by {§2} due to the index minimality requirement. 

Now let T(n, k) denote the class of all ITS with fixed n and k. Let Tfi{n, k) be the maximum 
number of nodes an ITS in T(n, k) can have. 

Lemma 3 r |CZllbj l. Th{n,k) = 2^^''^^''^ for k = 0{n) andTh{n,k) = 20("ig") fork = uj{n). 

Proof. Note that for fixed n, k and G : / — )■ 2*^, T(n, k, G) is uniquely determined. Also note that 
the height of T(n, k, G) is bound by ^ = min(n, k) and the maximum branching is bound by k. 
Let [T(n, /c, G)| denote the number of non-root nodes in T{n, k, G) 

Case 1: k = 0{n). In this case we have /i = 0{k). Therefore, we have 

\T{n,k,G)\ <Y,k' < nkt" = 2^^''+t'^^'' = 2^^^^^^^ 
1=1 

Since G is chosen arbitrarily, we have Th{n,k) = 2^^^^^^\ 

Case 2: k = u{n). Let G' : I' ^ 2^ where I' = [l..k'] be an extension of G such that range{G') 
contains all singletons from Q, i.e., 

Vi € [l..k] G'{i) = Gii) , 
{{q} I q & Q} <^ range{G') . 

By Lemma m we assume without loss of generality that [l..n] name the n singletons such that 
G'{i) = qi-i for i € [l..n]. Because of existence of all singletons, the minimal extension at each 
node is always done by adding singletons. Formally, for any index sequence a, Mini{a) C [l..n]. 
Therefore, each nonempty path in T(n, k' , G') corresponds to a nonempty prefix of a permutation 
of [l..n] and vice versa. Note that each leave is of height n, T{n,k' ,G') has exactly n! nodes at 
both the second last level and the last level. For n > 2, each node above the second last level has 
at leat two children, which implies the number of nodes above the second last level is bound by n!. 
All in all, we have 

\T{n,k',G')\ < 3n\ = 2^("'g"). 

By LemmaO \r{n,k,G)\ < \r{n,k' ,G')\. As G is chosen arbitrarily, we have Th{n,k) = 2'^("is"). 

□ 

Lemma 4. Let B : I ^ 2'^ , B' : I ^ 2^ be two injective functions such that range{B) = range{B'). 
Then \T{n,k,B)\ = \T{n,k,B')\. 
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Proof. The condition range{B) = range{B') means that B and B' just name subsets of Q differ- 
ently. We extend B, B' to functions from /* to 2*5 such that for a £ I*, 



\a\ \a\ 

B{a) = y B{a[i]) , B'{a) = [J B'{a[i]) . 

i=l 2=1 

By Definition m children of a node a in an ITS is completely determined by B{a). So a node 
a in T(n,k,B) has the same number of children as a node a' in T{n,k, B') if B{a) = B'[a'). 
Moreover, if qi, . . . , are children of a and q'^, . . . , a'^ are children of a', Then i?(Qi), . . . , B{aj) 
are just a permutation of B' (a'l) , . . . , B' (a'j) . By induction on tree height, there is a one-to- 
one correspondence between nodes in T{n,k,B) and nodes in T{n,k, B'). Thus |T(n, = 

|r(n,A;,i?')l- □ 

Lemma 5. Let T{n, k, G) and T{n, k' , G') be two ITS such that G' extends G by naming a singleton 
that G does not name. Then \T{n,k,G)\ < \T{n,k' ,G')\. 

Proof. It suffices to show that \T{n, k,G)\ < \T{n, k', G')\ when k' = k + l, I' = [l..k'], G'{i) = G{i) 
for ? G / and G'{k+\) names a new singleton. Without loss of generality we assume G'{k+\) = {qo}. 
We show a tree transformation 9 that turns T{n, k, G) into T(n, A; -|- 1, G'). 

Recall that we identify a node with the path from root to the node. By a node a, we mean 
a[l], . . . , are labels on the path and is the label of a. Let Tq, denote the subtree rooted 

at a and G{a) be 

IjGHi]). 

i=l 

We say that a state g € Q is named by a if g G G{a). We define on a subtree Tq as follows. 
Case 1. The new state qq is named by a. In this case, 9{Ta) = T^. 

Case 2. The new state qo has not been named by a. Let us assume that {qo} is named by a's 
children ai, . . . ,aj (j =0 means no children names {qo}). Let aj+i, . . . , aji be the rest children of 
a. Formally, we have 

qo ^ G{a) , 

qoeG{ai){ie[l..j],j>0), 

qo^G{ai){iG[j + l..j'],j>0). 

We have two sub-cases to consider. 

Case 2.1. For some i € [l.-j], G{ai) U G{a) = {qo} U G{a). In this case we must have j = 1. 
Let 6{Ta) be a tree obtained from Tq, by replacing the label of ai by A; -|- 1. 

Case 2.2. For all i € [l..j], G{ai)UG{a) D {qo}UG{a). In this case let 9{Ta) be a tree obtained 
from Ta by the following procedure. 

1. Add to a a new leaf a' labeled with k + 1, 

2. Remove subtrees ai, . . . ,aj from a and make them children of a' 

3. Add to a' j' — j new leaves a'^, . . . , ct'j,_j, respectively, labeled with 
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4. Grow the new j' — j leaves to full ITS according to Mini defined with respect to n, k' and 
G'. 



In any case, 9{Ta) is an ITS with root labeled with a[|a|]. It is clear from the above procedure 
that \9{Ta)\ > \Ta\- Now let Q{T) a tree obtained from T by applying 9 on T level by level, 
from top to bottom. Example [3] shows such a transformation. It is not hard to verify that 
T{n, k', G') = e(r(n, k, G)). Therefore, we have \T{n, k,G)\< \T{n, k', G')\. □ 

Example 3. Consider the same setup: n = 3, k = 4, Q = {qo,Qi,Q2}, and B : [1..4] 2*^ such 
that 



B{l) = {qo,qi} 



B{2) = {go} 



i?(3) = {91,92} 



5(4) = {92} 



as in Example\M Let Tq = T{n,k, B). Let k' = 5, B' = B U {(5, {91}). Applying 9 to the root of 
Tq we have Ti: 



5 : {qi} 2 : {go} 4 : {93} 

2: {go} 4: {92} l-{qo,qi} 4 : {92} 3:{<?i,<?2} 2 : {qo} 

III I I I 

4 : {92} 2 : {qq} 3 : {(71,92} 1 : {qo,qi} 2 : {qq} 1 : {qo,qi} 



Applying 9 to nodes (2) and (4) in Ti, we have T2: 




5 : {qi} 2 : {go} 4 : {92} 

2: {go} 4: {52} 1 : {^i} 4 : {92} 3 : {gi} 2 : {go} 

I I I I I 

4:{g2} 2: {go} 3:{gi,g2} l:{go,5i} 2 : {go} l:{go,9i} 



Applying 9 to nodes (2,4) and (4,2) in T2, we have T^: 



5 : [qi] 2 : {go} 4 : {g2} 

2: {go} 4:{g2} 1 : {gi} 4 : {g2} 3 : {gi} 2 : {go} 

I I I I I 

4:{g2} 2: {go} 3:{gi,g2} 1 : {gi} 2 : {go} 1 : {gi} 

And no more application of 9 is possible. It is not hard to verify that T3 = T(ra, fc + 1, B'). 

Theorem [2] (Streett Determinization: Complexity). Let A = {Q,Qo,Ti, A, {G,B)j) be a Streett 
automaton with |Q| = n and / = [l-.k], and B = (Q, 90, A, [G, the deterministic Rabin 
automaton obtained by Procedure [3 If k = 0{n), then |Q| = 20i^^sn+nkigk) |j| ^ o{nk). If 
k = Lo{n), then |Q| = 2^("''e") and |I| = 0{n^). 
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Proof. As mentioned before, a fiSTS has at most n{p, + 1) nodes. Let T(e, /) denote the number 
of ordered trees with e edges and / leaves. T{e,l) are called Narayana numbers, which for e,/ > 1, 
assume the following closed form [Nar591 IDZ80] : 

Let OT{n,fj,) be the set of ordered trees with at most n leaves and n{fi + 1) nodes. We have 

n-{fi+l) min(n,e) 



|OT(n,^)|< E T{e,l)<n^-Tin\n)<nr ]( ] 

e=l 1=1 V / V"- / 



20(12 Ign) 



A fiSTS is a tree in OT{n,fi) augmented with four components: name, color, set label, and index 
label. By Condition (12120 . the set labels of nodes are completely determined by the set labels of 
leaves. There are at most n" ways to label leaves, each of which corresponds to a function from 
[l..n] ^ Q. 

Let us consider index labels on each ^5*. As stated before, only the head of an LS can be 
labeled by 0. As each LS is fully grown, the sequence of index labels from the leaf of an LS to the 
head of the LS corresponds to a path in the ITS from a leaf going upward. So for each LS, there 
are at most Th{n,k) ways to do index labeling. Therefore, by Lemma [3l the number of ways of 
index labeling a ^lSTS is bounded by (T?,(n,fc))", which is 2^('^^ig'=) if /fc = 0{n) and 20("''e'^) if 
k = a;(n). 

The names along an LS from the head downwards is continuously increasing from the initial 
value of a bucket. So they are completely determined once a bucket is chosen. There are at most 
n buckets, and therefore the number of ways to name a fiSTS is bounded by n! = 2'-^("^s"). 

The colors on an LS also have some regularity. It is not hard to see that there is at most one 
green node in an LS and all red nodes must be descendants of yellow nodes. So there are at most 
+ 1)^ choices to color an LS and hence the number of color combinations on a fiSTS is bounded 
by (/i + 1)2", which is 20("'e^) if A: = 0(n) and 2°("ig") if A; = a;(n). 

Put all together, we have state complexity 2'^("ig"+"'=ig*^) if /t = 0{n) and 2'^("is") if k = 
u}{n). Figure [4] compares the costs attributed by each component of STS between the previous 
constructions and ours. □ 

Corollary [1] (Generalized Biichi Determinization: Complexity). Let ^ be a generalized Biichi 
automaton with state size n and index size k. There is an equivalent deterministic Rabin automaton 
B with state size 

20(nignfc)^ The index size is 0{nk) ii k = 0{n) and 0{v?) li k = uj{n). 

Proof. Let A = {Q,Qo,T,, A, {B)j). The acceptance condition {B)j can be viewed as a Streett 
condition {G, B)j where G{i) = Q for every i & I. In this case, the ITS is just a one-level tree (not 
counting the root). Th{n, k) is then bounded by k and the number of ways of putting index labels 
is bounded by {Th{n, /c))" = A;". Other complexity factors are all bounded by n'^^^\ So the total 
number of fiSTS is bounded by k"" ■ = 2^("is"^). □ 

Corollary [2] (Parity Determinization: Complexity). Let ^ be a parity automaton with state size 
n and index size k. There is an equivalent deterministic Rabin automaton B with state size 2*^("''s"') 
and index size 0{nk). 

Proof. Let A = {Q, Qq, S, A, {G, B)j). Because i?i C • • • C B^, the ITS is just a directed line. As 
a result, there is no index combinations and other complexity factors are all bounded by n^^"'\ So 
we have 2<^("ig"). □ 
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Theorem [3] (Rabin Determinization: Complexity). Let ^ be a Rabin automaton with state size n 
and index size k. There is an equivalent deterministic Rabin automaton B with state size 2'-'("'^'§"') 
and index size 0{nk). 

Proof. This theorem is a folklore, but here we sketch a proof just for the sake of completeness. Let 
A = (Q, Qo) ^1 [G, B\i). For each fixed i, Ai = (Q, Qq, S, A, [G(i), -B(i)]) is a Rabin automaton. 
Ai can be transformed into a Biichi automaton in 0(n) state, which is equivalent to a deterministic 
Rabin automaton Bi with 2*-^("^s") states. The desired B is just a product of {Bi)i^i. Let ;Sj = 
(Q«,gJ*\s,Ai,[G»,5«]^„). We defineS= (g,go,S,A,[G,^]j) where 

Q = q(i) X ••• xQ(^), 

A((g(i), . . . , a) = (Ao(g(i), a), . . . , Afc(g«, a)), 
/ = l(i) X {1} U • • • U /('^^ X {k}, 
G{{ij,i)) = X ••• X G^'\ij) X ••• X qW, 
B{{ij,i)) = X ••• X X ••• X Q^^^- 

It is easily to verify that B is deterministic, ^{B) = ^{A), the state size of ^ is 2^("^'g") and 
the index size of B is 0{nk). □ 
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